A Large-Scale Characterization of How Readers Browse Wikipedia

نویسندگان

چکیده

Despite the importance and pervasiveness of Wikipedia as one largest platforms for open knowledge, surprisingly little is known about how people navigate its content when seeking information. To bridge this gap, we present first systematic large-scale analysis readers browse Wikipedia. Using billions page requests from Wikipedia’s server logs, measure reach articles, they transition between these patterns combine into more complex navigation paths. We find that behavior characterized by highly diverse structures. Although most paths are shallow, comprising a single pageload, there much variety, depth shape vary systematically with topic, device type, time day. show commonly mesh external pages part larger online ecosystem, describe naturally occurring distinct targeted in lab-based settings. Our results further suggest abandoned low-quality pages. Taken together, insights contribute to understanding readers’ information needs allow improving their experience on Web general.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Deriving a Large-Scale Taxonomy from Wikipedia

We take the category system inWikipedia as a conceptual network. We label the semantic relations between categories using methods based on connectivity in the network and lexicosyntactic matching. As a result we are able to derive a large scale taxonomy containing a large amount of subsumption, i.e. isa, relations. We evaluate the quality of the created resource by comparing it with ResearchCyc...

متن کامل

Search strategies of Wikipedia readers

The quest for information is one of the most common activity of human beings. Despite the the impressive progress of search engines, not to miss the needed piece of information could be still very tough, as well as to acquire specific competences and knowledge by shaping and following the proper learning paths. Indeed, the need to find sensible paths in information networks is one of the bigges...

متن کامل

Constructing Large-Scale Person Ontology from Wikipedia

This paper presents a method for constructing a large-scale Person Ontology with category hierarchy from Wikipedia. We first extract Wikipedia category labels which represent person (hereafter, Wikipedia Person Category, WPC) by using a machine learning classifier. We then construct a WPC hierarchy by detecting is-a relations in the Wikipedia category network. We then extract the titles of Wiki...

متن کامل

Visualizing large-scale human collaboration in Wikipedia

Volunteer-driven large-scale human-to-human collaboration has become common in the Web 2.0 era. Wikipedia is one of the foremost examples of such large-scale collaboration, involving millions of authors writing millions of articles on a wide range of subjects. The collaboration on some popular articles numbers hundreds or even thousands of co-authors. We have analysed the co-authoring across en...

متن کامل

WikiReading: A Novel Large-scale Language Understanding Task over Wikipedia

We present WIKIREADING, a large-scale natural language understanding task and publicly-available dataset with 18 million instances. The task is to predict textual values from the structured knowledge base Wikidata by reading the text of the corresponding Wikipedia articles. The task contains a rich variety of challenging classification and extraction sub-tasks, making it well-suited for end-to-...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: ACM Transactions on The Web

سال: 2023

ISSN: ['1559-1131', '1559-114X']

DOI: https://doi.org/10.1145/3580318